A Combination of Similarity and Rule-based Method of PolyU for NTCIR-12 STC Task

نویسندگان

  • Chuwei Luo
  • Wenjie Li
چکیده

In this report, we describe the approach we use in NTCIR-12 Short Text Conversation task. Because we register this task too late and we only have less than one week to do this task, we design a simple approach that is based on cosine similarity of sentence and some handcrafted rules. The result shows the effectiveness of our method.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Nders at the NTCIR-12 STC Task: Ranking Response Messages with Mixed Similarity for Short Text Conversation

Short Text Conversation (STC) is a typical scenario in manmachine conversation, which simplifies the conversation into one round interaction and makes the related tasks more practical. This paper presents a simple approach to the Chinese STC task issued by NTCIR-12. Given a repository of post-comment pairs, for any query, we define three types of similarity and merged them according to empirica...

متن کامل

ICL00 at the NTCIR-12 STC Task: Semantic-based Retrieval Method of Short Texts

We take part in the short text conversation task at NTCIR-12. We employ a semantic-based retrieval method to tackle this problem, by calculating textual similarity between posts and comments. Our method applies a rich-feature model to match post-comment pairs, by using semantic, grammar, n-gram and string features to extract high-level semantic meanings of text.

متن کامل

Microsoft Research Asia at NTCIR-12 STC Task

This paper describes our approaches at NTCIR-12 short text conversation (STC) task (Chinese). For a new post, instead of considering post-comment similarity, our system focus on finding similar posts in the repository and retrieve their corresponding comments. Meanwhile, we choose frequency property of comments to adjust ranking models. Our best run achieves 0.4854 for mean P, 0.3367 for mean n...

متن کامل

USTC at NTCIR-12 STC Task

In this paper, we describe the system submitted by USTC team for the Short Text Conversation (STC) task of the NTCIR-12. We proposed transition-p2c, encoder-decoderReverse and joint-Train models for the STC task and submitted 5 official runs. The transition-p2c model provides transition probability between post and comment in word’s level which complements the TF-IDF feature. The encoderdecoder...

متن کامل

BUPTTeam Participation in NTCIR-12 Short Text Conversation Task

Abstract This paper provides an overview of BUPTTeam’s system participated in the Short Text Conversation (STC) task of Chinese at NTCIR-12. STC is a new NTCIR challenging task which is defined as an IR problem, i.e., retrieval based a repository of postcomment pairs from Sina Weibo. In this paper, we propose a novel method to retrieve post result from the repository based on the following four...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016